A Supernodal Cholesky Factorization Algorithm for Shared-Memory Multiprocessors

نویسندگان

  • Esmond G. Ng
  • Barry W. Peyton
چکیده

This paper presents a new left-looking parallel sparse Cholesky fac,torization algorithm for shared-memory MIMD multiprocessors. The algorithm is particularly well-suited for vector supercomputers with multiple processors, such as the Cray Y-MP. The new algorithm uses supernodes in the Cholesky factor to improve performance by reducing indirect addressing and memory traffic. Earlier factorization algorithms have also used supernodes in this manner. The new algorithm, however, also uses supernodes to reduce the number of system synchronization calls, often by an order of magnitude or tnore in practice. Experimental results on a Sequent Balance 8000 and a Cray Y-MP demonstrate the effectiveness of thc new algorithm. On eight processors of a Cray Y-MP, the new routine performs the factorization at rates exceeding one Gflop for several test problems from the Harwell Boeing test collection, none of which are exceedingly large by current standards.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supernodal Symbolic Cholesky Factorization on a Local-Memory Multiprocessor

In this paper, we consider the symbolic factorization step in computing the Cholesky factorization of a sparse symmetric positive definite matrix on distributedmemory multiprocessor systems. By exploiting the supernodal structure in the Cholesky factor, the performance of a previous parallel symbolic factorization algorithm is improved. Empirical tests demonstrate that there can be drastic redu...

متن کامل

Direct and Incomplete Cholesky Factorizations with Static Supernodes

Introduction Incomplete factorizations of sparse symmetric positive definite (SSPD) matrices have been used to generate preconditioners for various iterative solvers. These solvers generally use preconditioners derived from the matrix system, , in order to reduce the total number of iterations until convergence. In this report, we investigate the findings of ref. [1] on their method for computi...

متن کامل

An Efficient Solver for Sparse Linear Systems Based on Rank-Structured Cholesky Factorization

Direct factorization methods for the solution of large, sparse linear systems that arise from PDE discretizations are robust, but typically show poor time and memory scalability for large systems. In this paper, we describe an efficient sparse, rank-structured Cholesky algorithm for solution of the positive definite linear system Ax = b when A comes from a discretized partial-differential equat...

متن کامل

A New Recursive Implementation of Sparse Cholesky Factorization

Consider the Cholesky factorization of a sparse symmetric positive de nite matrix, A = LL . The rst two steps use symbolic, graph-theoretic techniques to order A to reduce ll in L, and to determine the exact sparsity structure of L. The factor L is computed in a third \numeric factorization" step. The two leading schemes for numeric factorization are a blocked column-oriented scheme, and a mult...

متن کامل

Ring-oriented Block Matrix Factorization Algorithms for Shared and Distributed Memory Architectures

Utilizing experiences from the implementations on shared memory multiprocessors (SMM) and distributed memory multicomputers (DMM), general ring-oriented routines are developed for the LU, Cholesky, and QR factorizations. Since, all machine dependencies are comprised to a small set of communication routines, the same factorization routines can be used on both the SMM and DMM architectures. The a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • SIAM J. Scientific Computing

دوره 14  شماره 

صفحات  -

تاریخ انتشار 1993